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Introduction 

The SDS 940 SNOBOlA system will accept programs written 
in a language which is basically compatible with a subset of 
Bell Labs' November 22, 1967 version of SNOBOlA. SNOBOlA is 
not a superset of SN0B0L3 but is in most ways very similar 
to SN0B0L3. The major exception is in pattern matching and 
the pattern datatype. The SWOBOlA system permits programs to 
be created, run and debugged interactively. 

The principal data object in the SNOBOL language is a 
string of characters. The language permits building up longer 
strings from shorter strings through concatenations. In 
addition, through pattern matching, strings can have their 
contents tested and have the matched substrings assigned to 
string variables. 

Other features of the. language are arithmetic on integer 
strings, built-in functions for general use, and programmer 
defined functions which may have local variables and can be 
recursive to arbitrary depth. Input-output from files is 
provided as well as from the teletype. 
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StJOBOlA Program 

A SNOBOlA program is a set of statements, each involving 
a rule. A set of rules provides the means for manipulating 
strings and other data objects. Each statement of a prograia 
is written only with printable characters, but the contents 
of the data strings can be any 8-bit characters. The & character 
is reserved as an escape character for entering non-printable 
characters literally into the source program. To enter an & 
in a source program use && (see section on special operations). 

Characters in the language 

blank I "#$^&' ( )* + , - . /pl23h3(>7Q9: |<«>? 

#ABCDlPGHIJKI,MNOPQHSfUVWXIZC\ ] t«- 
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Strlngs 



A string is a sequence of 8 bit characters ordered from 
left to right (see special operations for entering non-printable 
characters). A string may be represented literally in the language 
by surrounding its contents by a pair of single quotes or 
double quotes. When one kind of quote is used^ only the other 
kind may appear within the literal string. A fundamental 
property of a string is its length. In particular, the string 
of length exists sind is called the null string. It can appear 
literally as " or "". 

The string which contains the digits in order from ji to 9 
can be literally written as 

•)Z)l23i»'56789' or "0123^^56789" 
These are legal " ' " These are illegal ' ' ' 



"'/FILENAME'" 
This is a string of length P 

1 
2 

3 
These strings have different contents 
One contains AB, the other contains BA. 







'CAN'T' 


t 1 




the null string 


'X' 




the string containing X 


•PQ' 




the string containing PQ. 


'*1:' 




the string containing *1: 


ents: 


'AB' 


, 'BA' 



o 

Names and Variables 

Names in the SNOBOlA- language may be of any length (up to 
kO^ characters). The first character must be a letter or ®. 
Each of the remaining characters must be a ., letter, or digit. 
The @ is intended for keyword names (see section on keywords). 
Variables in the language are those things which are given a 
name and have strings, patterns, or some other data object as 
their contents. 

These are legal names These are illegal names 

X i 

STRING lABC 

©ANCHCB AT® 

A.LONG.NAME .NAME 
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String Assignment 

In a string processing language it is necessary to store 
strings, to build up longer strings, to test strings for their 
contents, and to take strings apart. The storing of a string is 
specified by an assignment rule of one of the following forms: 

STRINGNAME '=' LITERAL STRING 
STRINGNAME '=' STRINGNAME 
STRINGNAME '=' 

Blanks around the = are not necessary, but all other binary 
operators in the SNOBOLU language require blanks on both sides. 
The third example is semantically equivalent to the first with 
a null string, ", on the right-hand side. Names \rtiich have 
not been assigned a value contain the null string. The above 
rules say to take the contents of the strings on the right-hand 
side and store them in the string variable ^rtiose name is given 
on the left hand side. 

Examples: 

STRING = 'THING' 

ALPHABET = 'ABCDEFGHIJKIMNOPQRSTUVWXYZ' 

LETTERS = ALPHABET 

NULL = 
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Concatenation 

Building iip longer strings can be specified by concatenation 
(or juxtaposition). Any number of strings may be concatenated 
to produce one long string. The operation is denoted by a space 
between each of the parts to be concatenated (sometimes parentheses 
are required to denote the range of the concatenation). Thus, 
to store the results of a concatenation into a string variable, 
simply use an assignment rule with the concatenation appearing 
on the right-hand side. 

Assxjme the following axe executed in order: 

A = 'ALPHA' A would contain the characters ALPHA 

B = 'BETA' B would contain the characters BETA 

D * A B C D woTild contain the characters ALPHABETA 

since C is assumed to be the n\all string, 
A = B A would contain the characters BETA 

B = B would contain the null string. 
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Simple Pattern Matching 

It is often desirable to know if one string is contained 
somewhere within another string. A test of this type is denoted 
by a rule of the form: STRING ' ' STRING. That is, the string 
to be tested (the subject string), followed by a blank (or 
blanks), followed by the string to be searched for in the subject 
string (the object string). The possible confusion between 
pattern matching and concatenation is avoided by the fact that 
the subject string must be the first string in the statement 
and be immediately followed by another string, the object string, 
with a separating blank (or blanks). If the subject string is 
to be a concatenation of other strings, then the concatenation 
must be surrounded by parentheses. If the object string is to 
be a concatenation, it does not have to have surrounding 
parentheses. If the object string is found anywhere in the 
subject string, i.e., it is a substring of the subject string, 
then the pattern match succeeds , otherwise, the pattern match 
fails . 

Each pf the following statements indicate pattern matching 
is to be done. 

NAMEl NAME2 

NAMEl NAME2 NAME 3 mmk 

NAME 'STRING' 

'STRING' NAME 

(NAMEl 'STRINGl') "STRING2" NAME2 
Assume X = 'AB' Y = 'ABC' Z = 'ABCD' 
The following pattern matches succeed. 

X 'A' 

X 'B' 

X 'AB' 

X X 

y X 

Z X 'CD' 

Z Y 

(X Y) "BAB" 

•BABABCABCDC' X Y Z 
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The following pattern matches fail 

X 'X' 

X Y 

X Z 
'A' Y 
(X Y) 'AB' Z 
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Labels 

Any statement in a SN0B0L1+ program may be labeled. A 
statement is labeled if there is a character in the first 
character position (except '*•). The label is all the characters 
up to the first blank. If a statement is to be unlabeled, the 
first character position must be blank. The purpose of the 
label is to give a name to the statement so that it may be 
referred to easily. ERD, RETURN, FRETURN may not be used as 
labels since they are reserved for special purposes. 

The following statements are labeled FIRST, LOOP and NAMEl. 

FIRST ALPHAMUMERIC = ©ALPHABET ©DIGITS 
LOOP X = INPUT 
NAMEl NAME2 NAME3 

The following statements are not labeled. 

A = 10 

NAMEl NAME2 NAME3 
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The Go-To Field 

The last field of a statement is called the go- to field. 
If it is not present then, after the current statement is executed, 
the statement below it will be executed. The field starts with 
a colon, :, (followed by amy nvunber of blanks). Only exit 



commands may follow the colon. 

exit commajids. 

' ( ' label name ' ) ' 
'S(' label name ')' 
'P(' label name ')' 



Below axe the three kinds of 

imconditional exit 
success exit 
failure exit 



If none of the exit commands are given after the colon, then 
the statement is treated like no colon was present. 

A statement fails (immediately) if any part of it fails, 
otherwise, it succeeds. If an unconditional exit is given, then 
the statement to be executed is given by the label name regardless 
of success or failure of the statement. Otherwise, the colon 
may be followed by a success exit, failure exit, or both 
(either order, blanks permitted between the commands). If a 
statement fails and it has a failure exit, then the next statement 
is given by that label name; similarly, if it succeeds and 
there is a success exit. Otherwise, the next statement to be 
executed is the following statement. An exit to END will 
terminate the execution of statements. 



The following are legal go- to fields. 



(loop) 

s(loop) f(doije) 
f(dowe) s(loop) 

S(HERE) 
F(NEXT) 



The next statement will be 



•ABC' 'B' 


:S(L1) F(L2) 


LI 


'AB' 'xy 


:S(L1) 


the following statement 


•AB' 'AB' 


:S(L1) 


LI 


X = Y 


: (AGAIN) 


AGAIN 


•X' 'Y' 


! (TOP) 


TOP 
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Slmple Pattern Matching Continued 

One method of statement failure is for a pattern match to 
fail. Using this fact, appropriate exit commands can he used 
to decide if one string is a substring of, another. If the 
subject string is given by name (i.e., it is not a concatenation 
or literal) then a successful pattern match can be followed by 
a replacement. The rule has the form: STRINGKAME STRING *=' STRING, 
where either STRING can be an arbitrary concatenation of strings. 
The subject string is searched from left to right for the first 
occurrence of the object string. If it is found, the part of 
the subject string matched by the object string is replaced by 
the string on the right hand side. If the pattern fails, no 
replacement is done since the statement fails immediately. For 
statements which contain pattern matching but no replacement 
field (i.e., no =) the subject string is not affected even if 
matching is successful. 

These statements result in TOPCARD = 'KING OF SPADES' and an 
exit to OUTIT. 

TOPCARD = 'ACE OF SPADES' 

TOPCARD 'ACE' = 'KING' :S(OUTIT)f(TRYAGAIN) 
These statements result in ST3 = 'ABBEXXXBBA' 

STl = 'AB' 

ST2 = 'BA' 

ST3 = 'ABBBABBABBA' 

ST3 STl ST2 = 'XXX' 
These statements result in TEST = ' ,PQR,WXY,KIM, ' 

LIST = ',PQR,WXY,KLM,XYZ, ' 

ELEMENT = 'X^' 

TEST = LIST 

TEST ',' ELEMENT ',' = ',' :S(SUCCESS)F(FAIL) 
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Fields of a Statement 

There are five fields to every statement. 

LABEL REFERENCE PATTERN REPLACEMENT GO-TO 

If the label field is missing, then the statement is unlabeled. 
If the reference field is missing, the pattern ajid replacement 

field must be missing. Thus, the statement is at most a qo tp 

MMMIMI' statement that will succeed. 
If the pattern field is missing, then the statement is at most an 

assignment statement. 
If the replacement field is missing, it is at most a pattern match 

without replacement. 
If the go-to field is missing, the following statement will 

be executed next. 

The following statements contain: 

1. Pattern match with replacement and an unconditional exit, 

2. Simple assignment, 

3. Unconditional branch, 

k. Pattern match with exit depending on success or failure 



O 





of 


the 


pattern match. 










Label 
Field 

LABEL 




Reference 

Field 


Pattern 

Field 

PAT 


Replacement 
Field 


Go -To 
Field 

:(GOTOHEXT) 


1. 


REi-'EK 


» REPLACE 


2. 






NAME 




= 'ABC' 




3. 


FINI 










: (END) 


k. 






SUBJECT 


OBJECT 




:S(S) F(P) 
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Teletype Input and Output 

INPUT, OUTPUT, INPUTC, OUTPUTC are special teletype input- 
output variables. Anytime the variable OUTPUT is assigned a 
string value its contents are printed. A carriage return and 
linefeed are supplied at the end of the string and after every 
72nd character printed on the teletype. Anytime the variable 
INPUT is used, its value will be collected from the teletype up 
to a carriage return, which is deleted from the string. 

INPUTC and OUTPUTC are used for character-oriented input 
and output rather than line input and output. INPUTC collects 
exactly one character from the teletype. OUTPUTC outputs its 
contents to the teletype when it is assigned a value. No 
carriage retiorns are supplied, that is, it outputs its contents 
literally (see special conventions concerning line input). 
Execution of the following will print the line "NOW IS" after 
the line "NOW" is typed in. 

OUTPUT = INPUT 'IS' 
After the following program is rxun. the teletype line will be 
"APPEND", "BREAK", or "CHANGE", or a character which is not 
A, B, or C followed by a ? . 

X = INPUTC 





X 'A' :S(A) 




X 'B' :S(B) 




X 'C :S(C) 




OUTPUT = '?' 


A 


OUTPUT = 'PPEHD' 


B 


OUTPUT = 'BEAK' 


C 


OUTPUT = 'HAHGE' 



/ ~\ 



(END) 
(END) 
(END) 
(END) 

The following statements will print "PROBLEM NUMBER" on the 
teletype and will pick up a response terminated by carriage 
return on the same line. 

OUTPUTC = 'PROBLEM NUMBER' 

NO. = INPUT 



Binary and Unary Operators 

There are many operators in the SNOBOL^)- language, e.g., 
+/ -•) *> I ) **; $*=>•>•• A binary operator requires a space 
on both sides of it (except for the binary operator space, as 
in concatenation, and the = operator) . A unary operator may 
not have a space between it axid its operand. Parentheses are 
not required for multiple unary operations (see precedence table), 

The following are legal statements: 
X=A * B 
A B = C 
X = $$$Y 

These are illegal: 
X = A* B 
A B .C 
X = $$$ Y 



,) 
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Arithmetlc 

A string is an integer if it is the null string (value p) 
or it is a string of digits with or without a leading + or -, 
and its absolute value is less than 2 ^-1. A literal string of 
digits may be written with or without surrounding quote marks. 
AritlHnetic on integers resiilts in integers with leading + signs 
and ^'s suppressed. If the value of an arithmetic operation is 
p, the result will be the string 'jZJ* . The binary operators 
+} -) *} I , ** are used for addition, subtraction, multiplication, 
division, and exponentiation, with the usual precedences prevailing. 
The unary operators +, - are used for plus and minus. Parentheses 
can be used as needed. An arithmetic operation will cause an 
error message if the resulting integer is too large, if division 
by P occurs, or If p is raised to a power < p. 

G?hese are integers 
'123' 
123 
+1 

These are legal statements 
X«(Y + 2 + l)**W 
OUTPUT =5 1 '2' 

These statements output the result of dividing X by Y. If 

Y is 0, it will output "INFINITE". The function NE, not equal, 

will be explained later. 

ANSWER = NE(Y,0) X / Y :S(0UTIT) 

INF OUTPUT = 'INFINITE' :(END) 

OUTIT OUTPUT = ANSWER :(END) 
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Indlrect Referencing 

A program may construct names by using the unary operator 
$ airplied to a string. The result is a name which is the same 
as the contents of the string. Indirect referencing may appear 
anywhere that a naae is legal (except in the label field). In 
the go-to field the resulting name should he a label. IMirect 
referencing can becoaie a remarkably powerful facility since it 
provides the ability to chsinge the names that are used in a 
statement between executions of that statement. It is important 
to note that names obtained by indirect referencing do not 
have to conform to the @, letters, digits, aad . rules for 
names appearing in the source language. 

If lAMEl = ' ALPHA', NAME2 «= •BETA',1AME3 = 'GAMMA', and UML = 'OK' 
then the following tvo statements ■would accomplish the same thing. 

ALPHA BETA = GAMMA :S(OK) 

$HAME1 $1AME2 = $MME3 :S($LABEL) 
If Y = 'A', A = 'B', B = 'C, C = 'D' then after the following 
statement is executed X will contain D. 

X = $$$Y 



-17- 
Grouping 

Parentheses are used for grouping parts of a statement 
together, e.g., in arithmetic operations. The subject string 
of a pattern match can be given by a grouping of a number 
of strings together, or the name of the subject string can be 
given by a $ applied to a grouping. In general, groupings can 
appear in any field of a statement except the label field. 

This is a legal statement. 

$(X Y Z) (A + B) / C = (A * B) ** 2 :S($(Al)) 
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Functions 



In most prograinning languages the idea of a function is 
perhaps the most powerful feature- A function will take some 
arguments and produce a result which depends on those arguments, 
A ftinction appears in the SKDBOlA language as a function ttame 
followed by a (, followed by a list of arguments separated by 
coimas, followed by a closing ')'. Null arguments are permissible 
and missing arguments are asstaaed to be null. SNOBOlA functions 
are recursive and the arguments are transmitted to the function 
by value (to be explained later) . fhroughout the SNOBOlA- 
language there are a number of pre-defined functions, e.g., 
SIZE(S), LE(I,J), LT(I,J), GE(I,J), GT(l,J), 1Q(I,J), lffi(l,j). 
SIZE returns the length of the string arguiaent. The others 
make comparisons between two integer string arguments. For 
example, IiT(I, J) returns the null string if I < J; otherwise, 
it fails. SNOBOrt functions may either succeed or fail. 
If a function succeeds, it will return a value (many times 
it is a null string) . A function may appear in any field of 
a statement (not the label field). It is a fatal error for 
a function to fail if it is in the go-to field. 

fhe following aye legal statements 

X = SIZE(Y) 

X « GT(Y,X) Y :S($('X' SIZE(y))) 
If X and Y are null strings, then the following function calls are 
equivalent. 

EQ(X, Y) 

EQ(,Y) 

EQ(X, ) 

EQ(X) 

mi,) 
mi ) 
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User Functions 

The user is permitted to define his own functions. There 
are four parts to the use of a user defined SWOBOL function. 

1. Defining the function, listing its formal arguments, its 
local variables, and the label of its starting statement. 

2. Calling the function with actual parameters. 

3. Executing the function. 

h. Returning from the function with a value or a failure 
return from the function, and in either case restoring 
saved values. 

A function is defined by executing a DEFINE function with 
appropriate arguments. The DEFINE function has two arguments. 
Tlie first argument is a string which contains the name of the 
function, followed by (', followed by a list of formal arguments 
(if any) separated by ,'s, followed by')', followed by a list 
of local variables (if any) separated by», 's. The second 
argument is a string which contains the label name of the first 
statement to be executed in the function. If it is null, 
the label name is assumed to be the same as the function name. 

The body of a function can be any of the statements of 
the program. The termination of a function is by an exit to 
RETURN or ERETURN. (RETURN & FRETURN cannot be user-defined 
labels . ) 

The call of a function is done when the part of the state- 
ment containing the fimction is evaluated (see order of evaluation). 
It appears in the source statement as a function name followed 
by an argument list in parentheses. The execution of the 
function is as follows. The actual arguments have been 
evaluated, i.e., all operations and function calls in the 
arguments have been completed, yielding actual argument values 

(of any datatype) to be assigned to the formal arguments. Then 

! 
the cixrrent contents of the variable whose name is the same as 

the function name is saved. Similarly, the values of the formal 

arguments and the local variables are saved in the order specified 

when the function was defined. Then for formal 
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axguments are given the values of the actual arguments. The 
assignments are done left to right; each actual arg\iment is 
assigned to the formal argument in the corresponding position. 
Any missing actual arguments are assiamed to have a null value. 
The variable whose name is the same as the function name is 
given a null value and the local variables are also assigned 
null values. 

The function is terminated by an exit to either RETURN 
or FRETURN. If the exit is to RETURN, then the function's 
value is the contents of the variable whose name is the same 
as the function name. If the exit is to FRETURN, then the 
statement vrtiich calls the function fails. In either case the 
saved values of the variable whose name is the same as the 
function name, the formal arguments, and the local variables 
are restored. It is quite permissible for a function to call 
other functions (before returning) including itself. Any exit 
to RETURN or FRETURN is a return from the most recent function 
call. The number of functions called which have not yet 
ret\irned is called the level of recursion. Every call of a 
function increases the level by one. Before any function has 
been called the level is l6. A function which is called at 
level n changes the level to n+1 and the return from the function 
is when the level changes from n+1 to n by a RETURN or FRETURN. 
An exit to RETURN or FRETURN at level X is an error . 

The following is the renowned factorial function. 

FACT FACTORIAL = GT(N,;eO N * FACT0RIAL(N - l) :S(RETURN) 

FACTORIAL = 1 -.(RETURN) 
A program which takes a niomber N >^from the teletype and outputs 
Nl is the following. 
START DEFINE ( 'FACTORIAL(N) * , 'FACT' ) 

OUTPUT » FACT0RIAL( INPUT) :(END) 
FACT FACTORIAL = GT(N^ N * FACT0RIAL(N - l) :S(RETURN) 
FACTORIAL = 1 .'(RETURN) 



3 
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The maximiom function of two integer arguments can be defined by 
the statement 

DEFINE ('MAX(X,Y)') 

and the function body can be 

MAX MAX = GT(X,Y) X :S(RETURN) 
MAX = Y : (RETURN) 
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Dlstinction Between Names 

The names of a variable, a function, and a label are 
distinct even when they are spelled the same. But there is the 
definite connection between the value of a function and the 
contents of the variable whose name is the same as the function's 
name. Also, it is common for the label of the first statement 
in the function to be the same as the fvinction name. 



O 
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Order of Evaluation 

The order of evaluation of a statement is extremely 
important in determining the effect of the statement's execution. 
The ordering is as given below and is left to right in all 
fields, except sis modified by the precedence of operators. 

1. The reference field is evaluated. If it fails, the 
statement fails. 

2. If there is a pattern field, it is evaluated. If it fails, 
the statement fails. 

Pattern matching is attempted. All immediate assignments 
(to be explained later) are done regardless of eventual 
success or failure of the pattern match. If the match fails, 
the statement fails. If the match succeeds, all assignments 
(to be explained later) are done regardless of eventual 
success or failure of the statement. 

3. If there is a replacement field, it is evaluated. If it 
fails, the statement fails; otherwise, the replacement or 
assignment is made. 

If any of the above fails, the statement fails; otherwise, 
the statement succeeds. 
k. The appropriate go- to field is evaluated. All function 
calls within the go-to field must succeed. 
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Patterns 

So far the only data object discussed has been a string 
(although some strings are integers). In this section we will 
introduce a new object called a pattern. Since a pattern is a 
data object, it may be stored in a variable. That is, assignment 
statements with a pattern in the right-hand side store the 
pattern in the variable on the left-hand side. So far we have 
met just one kind of pattern matching, i.e., a test of whether 
or not one string is a substring of another. For the purposes 
of this section, a string can sometimes be thought of as a 
pattern (although it is a data object of type string, not pattern). 
The general idea behind a pattern is that the pattern matches 
a number of different strings. It tries each of the possible 
matches against the subject string in some specified order. The 
first match is taken as the successful pattern match; the 
matched substring is replaced if there is a replacement field. 
If none of the possible substrings match, then pattern matching 
fails. Below are listed the available pattern elements and 
rules for combining them. 

Alternation ("OR") 

A pattern which can match whatever any one of a number of 
alternative patterns will match may be formed by using the 1 
binary operator. The operands are patterns (or strings). The 
'. operator has lowest precedence of all operators. If PI, P2, 
and P3 are patterns, then the pattern which will match whatever 
PI, or P2, or P3 matches can be written PI l P2 1 P3. First, 
PI is tried for a match; if it fails, then P2; if it fails, 
■I;;hen P3. 

If P = 'AA' '. 'AB' I 'AC, then P can match any of the substrings 
'AA', 'AB', or 'AC. 
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Concatenatlon 

A pattern may be formed by the concatenation operator (space) 
which can match the concatenation of strings matched by each of 
a number of patterns. If PI matches some part of the subject 
string and P2 matches some other part of the subject string 
such that the two parts are adjacent in the subject string, then 
PI P2 matches the concatenation of the two parts. 

If P = 'A' ('A' • 'B' I 'C'), then P can match an A followed by 
an A, B, or C. 

Arbitrary Strings 

The variable name ARB contains the primitive pattern \rtiich 
can match any number of characters. It first matches the null 
string (0 characters). If that fails, it will try one more 
character, etc . 

'A' ARB ('B' I 'C') 

can match substrings of the form A followed by any niimber of 

characters up to a B or a C. 

ARB ' , • can match any substring ending in a , . 

Balanced Strings 

The variable name BAL contains the primitive pattern which 
can match any non-null string of characters which is balanced 
with respect to the niimber of left and right parentheses. That 
is, it matches at least one character, and left and right 
parenthesis can be paired up such that every left parenthesis 
comes before the corresponding right parenthesis. 

Thus, BAL can match any of the following substrings 'ABC', 
'(XIZ)', '( )', •((AB)CD)' and not any of these ') (', 'ABC)', 
'((X)' . The following pattern match will succeed with ARB 
matching ')))(' and BAL matching ' ( ( ) ) ' , 
')))((( ))' ARB BAL. 

There are several primitive functions which will rettirn 
patterns as their value. 
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Fixed Length Strings 

The fxinction IiEN(N) requires an integer argxjment and 
returns as its value a pattern which can match any string of 
exactly N characters. 

'ABCDEPGH' LEN(3) 'G' 

Here LEN(3) matches VDEF' and 'G' matches 'G' . 

Fixed Positions In Strings 

The function POS(N) requires an integer argument and returns 
as its value a pattern which will match the null string immediately 
after the Nth character of the subject string. That is, it 
checks for the proper position in the subject string, in particular, 
P0S()6) will only match at the start of the subject string. 

Similarly, RPOS(N) will match the null string N characters 
from the end of the subject string. In particular, RPOS(0) 
will" match at the end of the subject string. ; ) 

SUBJECT POS(0) ML RPOS(0) 

This will succeed if the subject string is balanced with respect 
to parentheses since BAL is forced to match the whole string. 

Tabulation 

The fxinction TAB(n) requires an integer argument and returns 
as its value a pattern which will match all characters up to and 
including the Nth character of the object string. Similarly, 
RTAB(N) will match up to the last N characters. In particulaa:, 
RTAB(0) will match to the end of the subject string. 

Remainder 

The variable name REM contains the primitive pattern which 
will matph the remainder of the subject string. It is equivalent 
to RTAB(|Z5). 
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The following pattern match will succeed with TAB(4) matching 
'CD' and RTAB(2) matching 'EF' 

'ABCDEFGH' 'B' TAB(4) RTAB(2) 

In the following REM matches 'BABCBA' 

'ABCBABCBA' 'C REM 

Alternative Characters 

The function ANY(S) requires a string argument and returns 
as its value a pattern which will match any character which 
is in the string S. Conversely, NOTAlfl(S) will match any 
character which is not in S. 

Runs of Characters 

The function SPAN(S) requires a string argument and returns 
as its value a pattern which will match a string coniposed of 
characters which are in the string S. It will not match the 
null string, i.e., it must match at least one character. It 
will not match a string of characters if the run of characters 
from S can he lengthened, i.e., it matches up to the first 
character not in S or else the end of the subject string. 
Conversely, BREAK(S) will match characters which are hot in S 
up to the first character which is in S. It can match the null 
string and will not match if a break character cannot be fovind. 

Let X = 'ABCDEFGHIJKLMNOPQRSTUVWXYZ01231+56789' then the pattern 
P = (POS(jZi) : NOTANY(X)) NAME (NOTANY(X) 1 RPOS(0)) will match 
successfully if there is an occurrence of the string NAME in the 
subject string which is not preceded or followed by an alphanumeric 
character . 

, '123ABCr)456' SPAN('ABCDEFGHIJKIMNOPQPSTUVWXYZ') 
Here the SPAN matches 'ABCD' . 
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Repetitlons 

The function AEBNO(P) has a pattern argument and returns 
as its value a pattern which matches any string that would be 
matched by an arbitrary number of consecutive occixrrences of 
the pattern P. It first matches the null string. It is equi- 
valent to the pattern X where X = " ! P *X (the * operator 
will be defined later). That is, if it ever matches n P's, 
then it will try n+1 P's next. If the n+lst P fails to match, 
it will try more cases of n P's, if any. 

'ABCDEPGHIJKL' POSC^O ARBN0(LEN(3)) RPOSObO 

will match the complete subject string since it is of length 
12 = 3*^. 

Signaling Failure 

The variable name FAIL contains the primitive pattern which 
will always fail to match. The variable name FENCE contains 
the primitive pattern which will match the null string, but 
if tried for alternatives (rematch), it will cause pattern 
matching to completely fail. The variable name ABORT contains 
the primitive pattern which will cause pattern matching to 
completely fail. 
No matter what pattern P is, the following will always fail: 

SUBJECT P ABORT 
This succeeds 

'AB' 'A' FENCE 'B' 
This fails 

'ACAB' 'A' FENCE 'B' 
The FAIL alternative in the following is superfluous 

'ABC 'A' (FAIL I 'B' '. 'C') 
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The Order of Pattern Matching 

A pattern is made up of subpatterns which are combined by 
concatenation and alternations. The primitive patterns are 
the contents or else the returned values of: strings, ARB, BAL, 
LEN(N), POS(N), RPOS(N), TAB(N), ETAB(N), REM, AWY(S), NOTANY(S), 
SPAN(S), BREAK(S), ARBNO(p), FAIL, FENCE, ABORT. There are 
four states of the pattern matching process that are of interest: 
match, success, fail, rematch. Success and failure here have 
little to do with success and failure of the statement. These 
are local states of the pattern matching process. The previous 
sections state what each of the primitive patterns will first 
match. If for some reason a match of an element does not work 
out later on, it is tried for a rematch. Most of the primitive 
elements fail to rematch. ARB, BAL, ARBNO(P) can be tried to 
rematch. (FENCE aborts all matching on rematch.) ARB and 
ARBNO(P) first match the null string. BAL first matches a 
substring of one character, or else more, if the first character 
was a '('which needs to be balanced. On rematch whatever ARB 
has matched it extends that by one character. If no characters 
remain in the subject string, then ARB fails to rematch. What 
BAL has matched, on rematch it will try to extend that by 
another balanced substring of one or more characters. Failing 
that, rematch fails. ARBNO(P) on rematch tries to extend 
whatever it has matched by whatever another P will match. 
Failing that, it will rematch the previous P's. 

A match of a concatenation is attempted by trying to match 
its first operand. If that succeeds, it will try its next 
operand. If all operands eventually succeed, then the concatena- 
tion succeeds. If any operand fails to match, then the previous 
operand is tried for a rematch. If the first operand fails, 
then the concatenation fails to match. If a concatenation must 
be rematched, then the last operand is rematched, etc. 

A match of an alternative is attempted by trying to match 
the first operand. If that succeeds, the alternation succeeds. 
If it fails, then the next operand is tried. If all operands 
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fail to match, then the alternation fails to match. If an 
alternation must be reaatched, the operajad that ^.s matched 
last is rematched. If this fails, then the next operand is 
tried for a match, etc. 

The matching process begins with the first character of 
the subject string. Each primitive pattern elen^nt that 
matches extends the substring that has been matched* If pattern 
matching falls using the first character of the subject string, 
a pattern match is attenrpted starting with the next character 
in the subject string, and so on, until there are no more 
characters in the subject string at which to try to start a 
match. If pattern matching succeeds, it will have matched some 
substring of the subject string which can be replaced if the 
statement contains a r«placenffint field. If pattern matching 
fails, then all possible substring matches of the subject 
string have failed to match. Coa5>lete failiare of the pattern 
matching process causes the statement to fail. It is possible 
to set a mode where only matches which include the first 
character of the subject string (or no characters at all) are 
atten^ted. This mode can be set by assigning a negative integer 
to the kejrword (SANCHOR (see keywords). 
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Deferred Pattern Definition 

Patterns can be stored into «aaeiB-. When the nam e is used, it 
is just like using the pattern that was stored in the jwase. In 
particular, when a pattern is defined, it may be defined in terms 
of other patterns. When a pattern is constructed (defined), 
the cxxrrent values of its components axe used. Consider the 
following statements: 

P = -RPOSip) 

Q = FOSip) ARB P 

P = RPOS(l) 

SUBJECT Q 

What is the pattern Q that is used? When Q was defined, it 
became the pattern FOS{P) ARB RPOS(0) and it has not been 
redefined. 

At times it is desirable to define a pattern in terms of 
(^ ) another pattern without the value of the other pattern being 

defined yet. The unary operator *, when applied to a name (the 
name is evaluated at definition time), says to use the pattern 
given by the name whenever a match of this pattern is attempted. 
Because * operates on a name to yield (eventually) a pattern, 
the * operator may only appear where a pattern is allowed. 
In particular, it csinnot be used where an integer or string 
argument is expected, i.e., SPAN(*S) and POS (*N) are illegal. 

Altering the previous example a bit, now what is the 
pattern Q that is used? 

P = RPOS(ji) 

Q = POS (0) ARB *P 

P = RPOS(l) 

SUBJECT Q 
Q is P0S(jZ5) ARB *P which here is equivalent to POS(0) ARB RPOS(l). 

The * operator can be used to yield recursive pattern 
definitions . 

p = 'B' I *P 'C can match any of the following substrings 

•B', 'BC', 'BCC, 'BCCC, etc. 
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Value Assignment 

When a pattern successfully matches, it is possible to 
assign the substring matched by any contponent (subpattern) of 
the pattern to a variable. The binary operator . is used to 
indicate value assignment in case the pattern successfully 
matches. Its left operand is a pattern (or subpattern) and 
its right operand is a name. 

If pattern matching is successful and the subpattern was 
part of the successful match, then the substring that the 
subpattern matched will be assigned to the variable with the 
given name. If the subpattern was not part of the successful 
match, then no assignment is made. It is possible to do multiple 
assignments like ARB . X . Y. If the ARE was part of a successful 
match, then X and Y would receive the same value. Assignments 
are made left to right; thus, if two assignments are made to 
the same name, the last assignment would be the right most 
assignment. 

Consider the following pattern matches. 

'ABCDEFGH' (ARB . X 'P') • Y 

The pattern match succeeds resulting in the assignments 

X = 'ABCDE', Y = 'ABCDEF' . 

" 123^)56789' (1 . X l '2' . Y) LEN(3) . Z 

The pattern match succeeds resulting in the assignments 

X = '1', Z = '23^4-' . Y will retain its previous value. 

Suppose STRING = 'AB,CD,EF' . 

STRING ',' ARB . X ',' = ';' X ';' 
The pattern match will succeed with X = 'CD' and will result 
in STRING = 'AB;CD;EF' . 
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Immediate Value Assignment 

The value assignment described in the previous section 
occurs only on successful completion of pattern mat<t!hing. It 
is also possible to assign a substring matched by aijiy coianponent 
(subpattern) of a pattern vftienever that component successfully 
matches during the pattern matching process, regardijLess of the 
eventual success or failure of pattern matching. Tlfie binary 
operator $ is used in the same way . is used, except that 
assignments are immediate. Immediate assignment can be combined 
with deferred pattern definition, so that whenever a variable 
is assigned a new string value by immediate assignment the 
deferred pattern of the same name becomes a pattern which 
matches the new string value. Due to the fact that the pattern 
matcher will signal failure as soon as it knows it ^.s no use 
trying any more possible matches, immediate assignments may not 
/""^j always have their expected final value when the subpattern is 

not part of a successful pattern match. In order tp assure 
that the pattern matches will try all possible matches, the 
keyword QFULLSCAN can be set (to -l) . 

Consider the following pattern matches. 

'BABCABCD' BAL $ Z *Z 
The pattern match succeeds with Z = 'ABC' . The pattern 
BAL $ Z *Z matches only substrings of length 2 or larger in which 
the first half of the substring is identical to the second half 
sind is balanced with respect to parentheses. 

'ABC ('A' $ X I 'B' $ Y) 'D' 
The pattern match fails but X and Y are assigned new values. 
X = 'A', Y = 'B* since an A and a B occxu-jjf in the string. 
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Infinite Loops 

The pattern matcher is sophisticated enough to prevent all 
infinite loops (due to recursive pattern definitions) . When an 
infinite loop is detected, the matcher will know that it is 
useless to try to match some deferred pattern and will signal 
that the match of that deferred pattern fails, thus seeking 
alternative rematches. Suppose X = 'A' '. *X 'B' then, taken 
literally, the following pattern match would go into an infinite 

loop. c_j(;wflD6>'5 

'C (*X ! 'C') 
That is, X is first tried for a match. 'A' fails to match, 
therefore, the alternative is tried. The first thing in the 
alternative is a match of the current value of X. Thus, a 
second attempt to match X is made. 'A' fails to match; therefore, 
the alternative is tried. The first thing in the alternative is 
a match of the current value of X. Thus, a second attempt to 
match X is made. 'A' fails to matchj therefore, the alternative 
(to this instance of X) is tried. The first thing in the 
alternative is to match another X. And so it goes. The fact 
is that the pattern matcher catches this loop quite easily, 
signaling failure at the second instance of X. Thus, the first 
instance of X also fails. The alternative 'C is now tried and 
the pattern match succeeds. In particular, such patterns as 
X = -^X will always fail. 
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Efficiency in Patterns 



The following eight ideas on more efficient patterns are 
not exhaustive but cover many of the most coimnon or most costly 
cases of inefficiency. 

1. A pattern that could be anchored should be anchored if 
it can possibly fail and thus try many extra unanchored 
matches. Use FEHCE or POS(0) to anchor the pattern. 

2. ARBHO is relatively slow. It is much preferable to find 
another construction if possible (without resorting to 
deferred patterns). For example, in most cases, AHBNO(' ') 
is best replaced by SPAN(' ') : '' 

3. In many patterns a BREAK or SPAN can be used instead of ARB. 
In such places it is usually preferable to use such a 
construction since BREAK and SPAN are extremely efficient. 

k. Such constructions as '.' '. ',' are best replaced by 
ANY('.,') 

5. Immediate value assignment should be avoided if possible; 
otherwise, many superfluous assignments may be made during 
pattern matching. 

6. If possible, do not use FULLSCAN mode, 

7. When using deferred patterns, avoid left recursion and 
other associated inefficiencies. The pattern matcher can 
catch infinite recursion but it can be extremely expensive 
in time . 

8. It is very important that patterns be constructed once 
instead of everytime the pattern is to be used,. Constant 
patterns are best defined once and for all in the beginning 
of the program outside of program loops. This: is done by 
assigning the pattern to a variable and using %he variable 
wherever that pattern was to be used. 
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Additional Built-in Functions 



ustegieCx) 

TRIM(S) 
DATE() 

TIMlO 



CLOCK(I) 



1DENT(S,S) - 
DIFFER(S,S)- 

UMSTACIC(k) 



Returns a null string if X is an integer string, 

otherwise, it fails. 

fakes a string argument and returns thf same string 

vdth trailing blanks removed. 

Takes no argument and returns an 8 chai'acter string 

Ttftiich is the current date. 

Format MM/BD/YY 

Takes no argument and returns a 7 character string 

which is the cturrent time according to a 2i|-hour 

clock. 

Format HHMMiSS . 

Takes an integer string argument. If jbhe argument 

is P, it returns the el^sed time counter (BRS 88). 

If the argument is not 0, it retxarns the real time 

counter (BRS k2). Both counters are in units of 

1/6^ of a second. 

Compares two string arguments and returns the null 

string if they are identical, otherwise, it fails. 

Cc»ffipares two string argun»nts and returns the null 

string if they are not identical, othej-wise, it 

fails . I 
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Additlonal Input-Output Facilities 

INPUT, OUTPUT, INPUTC, OUTPUTC have been introduced for 
teletype input-output by line or characters. The following 
explains how to associate other string names with file input- 
output . 

To communicate with a file it must be opened. A file is 
opened by calling either OPENIN or OFENOUT depending on whether 
it is to be an input or an output file. These i^inctions require ' 
a single argument which is a string containing a complete file 
name. The returned value is a file number which is used to make 
references to that file. 
tv /i/A-' C^ '^° facilitate obtaining file names, the string name FILENAME 

— -f---- J when used like INPUT prints "FILE NAME" on the teletype and 

/^^^^^//AjL ^ collects a file name. Thus, the value of FII^NAME will be a 
"(MP^lLt f^ /complete file name. 

p^ijDM<t -l^^ Associations between string names and files is done by the 

0UAill''O^TF'IL''6 ASSIGN function, 'irtiich requires two arguments. The first 
.£fc/i A^ Ift-tdi' argument contains the string name, the second contains the file 
va . ' number. When a string name is assigned to an input file, all 

other assignments are voided. When it is assigned to an output 
file, other output assignments remain. 

Input and output can be by line or character. Initially, 
a string name which is assigned to a file is of type line. It 
can be changed to type character by calling the function CHAR 
which requires one argument, the string name. Similarly, it 
can be changed back to type line by calling the function LINE. 
New assignments to the string name will not change the mode. 

Character input is just one character. Line input from a 
file reads everything up to a carriage return, linefeed which 
is discarded. Line input from the teletype is in the same 
format as source statements. The resulting string has every 

<- deleted and has all & codes translated. End of file 

c 
(D as first character of teletype line) causes failure of 

the statement. 
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Character output is literal output of the contents of the 
string. Line output supplies needed carriage retxirhs and line- 
feeds and recognizes the line length of the output file. 

The line length of an output file is initially set to 72 
T*ien the file is opened. It can be changed by calling the 
fiinction, LENGTH, which requires two arguments. The first 
argument contains the file number, the second contains the new 
line length of the file, (if the second argument iS ff ox null, 
the new line length will be i?^. ) ^~" JL ~ ^ 

To release all input or output assignments associated with 
a string name, call the fiaiction RELEASE with the argument 
containing the string name. 

To close a file call CLOSE with the file numbet as the 
argument. A negative argument will close all files. 



Examples: 

N = opENiN(niiimtrn) 

ASSIGN('IN',N) 

CHAR('IN') 

M = OEENOUTC'/KEWFILE'") 

ASSIGN('0UT',M) 

ASSIGN('0UT',1) 

CHAR('OUT') LII]E('OUT') 

LENGTH(M,128) 

OUTPUT = INPUT :F(E0F) 

RELEASE (•OUT') 

CLOSE (N) 

CLOSE (-1) 
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The Editor, Compiler, and Runtime 

The SDS <^P SNOBOlJ'^ system is divided into two distinct 
parts: The editor -compiler and the runtime. The editor-compiler 
is used to write, modify, and compile source statements. The 
runtime is responsible for the execution of statements. 

The editor types % when it is ready for commands. The 
editor is in most ways like QED. Familiarity with QED is 
required to use the capabilities of the editor. The editor 
commands which are similar to Q^D commands are /, =, *-, APPEND, 
CHANGE, DELETE, EDIT, FINISHED, INSERT, MODIFY, QUICK, READ FROM, 
SUBSTITUTE, TABS, VERBOSE, WRITE ON. Additional commands are BREAK, 
GO, HELP,'* KILL, LIST,'^ PROCEED, and space followed by a SNOBOL 
statement (which cannot be a comment nor be labeled) to be 
immediately executed. Soxxrce statements can be read from a file, 
or using APPEND may be typed in directly. As each statement is 
read or typed, it is compiled. If there is an error, one edits 
the statement immediate]^. All standard QED addressing can be 
used; however, buffer operations are not available for addressing 
and editing. One other difference between QED and the editor 
is that every line typed in is an edit of the previous line typed 
or deleted. Qn.Q consequence is that control D is a terminator 
only when no characters axe in the new line. The QED commands 
will not be explained (see the QED manual); the editing control 
characters are summarized in Appendix B. 
GO - begins execution of the SNOBOL statements after closing 

all open files and clearing all variables and resetting preset 
variables and functions. " — OK" is printed out as a waxning; 
respond with "." . The first statement executed is given 
by the address of the GO. If no address is given, then 
execution begins at the first statement. 
EEIEAK - sets up breakpoints at all statements in the interval 

addressed. A break at a statement is made before executing 
that statement and returns control to the editor. 
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T^M? - i-"^"^ l^ <SO; Ld(- >uryrk»<j:^ ;>vtJj^oi^ei:MVv AaXa. 
KILL - releases all breakpoints in the interval addressed. 
LIST - prints all breakpoints in the interval addressed. 
PROCEED - continues execution after a breakpoint. 

A single rubout dxiring execution will cause a break at the 
start of the next statement.. (Remember that to complete the 
current statement all teletype input must be completed. You 
may also have to wait until the teletype output buffer is 
empty before seeing where the break was done; this buffer may 
have as much as 15 seconds worth of typing in it . ) 

A second rubout will return to the editor immediately 
(before finishing the statement). It is not possible to proceed 
in this case. 

An unlabeled statement may be executed while in the editor 
by typing it in. Of course, it must start with a space. In 
particular, branches (goto's) are legal. This is the way to 
begin execution without the side effects of the GO command. 

The following are equivalent ($ printed by the editor) 

$G0. 

$1G0. 

To set a breakpoint at every statement type 

$<9BREAK. 

Or to kill all breakpoints type 

$<aKILL. 

PROCEED and SBISt iHMt use/ addresses . 

lEROCEED. 

$LIST . 

This is an example of a SNOBOL statepieat line. 

$ OUTPUT = INPUT 
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Special Operations 

The source statements naist be written in printable characters. 
To enter non-printable characters, e.g., into a string, type & 
followed by 3 octal digits, e.g., Sel55, or else & followed by 
a non-octal character, e.g., M; however, in the latter case, 
characters p to 37 will remain unchanged, the others (ij-0-77) 
will become the corresponding control characters. Kote that 
& may only be entered by typing && or &OO6. To aid the above, 
the 1^ editing character produces four characters, &XXX, where 
XXX is the octal code of the next character typed. 

Continuation of a statement is possible by typing «- at 
the end of the line to be continued. This character is entered 
into the Source string and is treated like a blank ^en ¥ is 
used, but is ignored by the compiler. Therefore, be sure to 
type any needed blKiks in the source statement. In the editor 
typing linefeed is equivalent to typing «- . 

All teletype line input is edited and is subject to the 
same rules as a sotirce statement except that «- is deleted and 
& codes are translated into internal form. D in the first 
character position causes failure of the statement (due to end 
of transmission). 

Statements which begin with an * are treated as comments. 
Comments have no affect on the execution of the program. 
Execution is done the same as if the comments were not present. 
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Keywords 

Keywords provide an interface between the SNOBOlA program 
and certain internal symbols in the SNOBOLU system. It is 
expected that additional keywords beyond those listed below will 
be implemented. 

Read-only keywords: 

OSTCOUNT contains the mimber of statements that have been 

entered since execution began. 
©STFCOUNT contains the number of statements that have failed. 
©LEVEL contains the current level of recursion. 

User changeable flags (a flag which is non-negative is off, 
negative is on) : 

©ANCHOR if on, sets the mode of pattern matching to anchored, 
that is, all patterns must match beginning with the first 
chsiracter of the subject string. 

@FULLSCAN if on sets the mode of pattern matching to try all 
possible matches regardless of the impossibility of ever 
matching (i.e., no heuristics to speed up pattern matching). 

Changeable limits: 

©MAXLWGTH is the limit on the length of strings that can be 
formed. It is preset to Z'^.^pfi which is the largest it 
can be set. 

dSTLIMIT is the limit on the number of statements that can 
be executed. It is preset tof^mV 2 — i ( *-^ W>^i<y\Awvv^ 

®INTLIMIT is the limit on the msiximxim absolute value an 

integer can be. It is preset to ttMH#( and can have a 

\ 23 ^ 

majcimum of ^2 -1. 
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Literals (these keywords have predefined values and are unchangeable): 

9ALPHABET contains 'ABCDEFGHIJKIMNOPQiRSTUWffiYZ' 
©DIGITS contains '0123456789' 

The following have the same values that the corresponding 
predefined pattern variables initially have. 

(9ARB 

®ML 

@REM 

OFAIL 

®FENCE 

©ABORT 



Pseudo-Teletype Functions 

There are a number of predefined functions which enable 
communication with a pseudo- teletype . The list includes 
LOGIN( ), L0GOUT( ), WAIT( ), SEND(S), ATSEKD(S), R?CV(W), and 
RECVLINEC ). Additionally, there is the ECHO(N) function which 
can be used independently of the pseudo-teletype furiictions. 
The conditional coramajad processing capability produced by the 
combination of the pseudo-teletype fvmctions and the SWOBOL 
language has been inspired by the CCP subsystem. 

LOGIN( ) or L0GIN(MME, PASSWORD) 

The LOGIN function may be called with either two arguments 
or no arguments. The LOGIN fiinction attempts to log in (ENTER) 
the user at a pseudo- teletype either under his name or under 
another name. If two arguments are given, the first is a user 
name and the second is the corresponding password t(j) be used in 
entering at the pseudo- teletype. If no arguments a^-e given, the 
user is logged in under his own name and password. If LOGIN is 
successful, it returns a null string as its value aijid places the 
pseudo-teletype in BEGINNER mode at the EXECUTIVE l^vel with 
the input and output buffers clear. If LOGIN fails i it is due 
to one of the following reasons: 

1. No pseudo- teletype is free. 

2. No room on the time-sharing system. 

3. The user name or password is incorrect. 

It is an error to try to LOGIN if a previous LOGIN 1(k&s not been 
logged out. It is also an error to call WAIT( ), SEND(S), 
ATSEND(S), RECV(n), or RECVLINE( ) if LOGIN has not been 
successfully called. 
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K)GOOT( ) 

The K)GOUT function is used to log out the pseudo-teletjrpe . 
The user is automatically logged out (if logged in) whenever the 
GO or FIIISEED ccxnmand is used in the SNOBOL editor. If the 
LOGOUT function succeeds, it will return a null value. It will 
fail if the user is not logged in at a pseudo- teletype . 

WArr( ) 

The WAIT function always succeeds and returns the null 
string as its value, but hefore returning it waits until the 
pseudo-teletype is waiting for the teletype input with an empty 
input huff er. While it is waiting, it throws away all output 
from the last SIND or ATSEND function call. Also, before 
returning it clears the pseudo-teletype output bxiffer. 

SEND(S) 

The SEND function requires one ai'gument which must be a 
string. SEND first does a WAIT, then sends the characters of 
the argument string to the pseudo- teletype , An error results 
if the internal collection buffer (about 6000 characters long) 
of characters from the pseudo-teletype overflotw before all 
the characters are sent. The SEND function succeeds with a 
null string as its value. 

atsemdCs) 

The difference between ATSEND and SEND is that ATSEND does 
not do a WAIT,' instead it sends imbouts to the pseudo-teletype 
to get it back to the EXECUTIVE level. !feny times the user 
may want to wait until the pseudo-teletype is waiting for input 
or to receive all the output from the pseudo- teletype before 
preceding with an ATSEND. To do this either do a WAIT( ) or 
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enough RECV's or RECVLINE's to collect all the output. Both 
SEND and ATSEITO initialize the collection machinery, that is, 
previous output from the pseudo-teletype is discarded. 

RECV(N) 

RECV(n) takes an integer argument (N > 0). It collects literally 
the next X characters (X not greater than N but otherwise as 
large as possible) from the pseudo- teletype output ifesulting 
from the last SEND or ATSEND, If X is <fi, RECV fail$, indicating 
that all output from the last SEND or ATSEND has alrteady been 
collected. If X is greater than 0, RECV succeeds ir|i returning 
the X characters as its value. It should be noted tfhat after 
any SEND or ATSEND, at most one call of RECV(N) can successfully 
return with less than N characters. Also, the function fails 
only if the pseudo-teletype is waiting for input. 

RECVLINE( ) 

The RECVLINE function is used for receiving the output 
from the pseudo-teletype by line. The algorithm is that the 
first character is ignored if it is a linefeed, therji all 
characters up to a carriage return are collected and returned 
as the value of RECVLINEj the carriage return is discarded. 
If the output from the pseudo-teletype does not contain a 
carriage return, then all the remaining characters are returned, 
unless the returned string would be the null string, in which 
case RECVLINE fails. RECV and RECVLINE can be intermixed. 

The following will print the same thing as what would 
appear on the pseudo-teletype except it outputs an ^xtra 
carriage return, linefeed in the case where the last line from 
the pseudo-teletype does not terminate with a carriage return, 
linefeed. 
OUTLOOP OUTPUT » EECVLIKE( ) :S(OUTLOOP) 
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ECHO(W) 

The ECHO function requires an integer string argument. 
It succeeds and returns the null string as its value. ECHO 
is used for tiirning on and off the echoing of characters typed 
during teletype input in the running of a SNOBOL program. If 
the argument is negative, the echoing of characters is turned 
off, if non-negative, it is turned on. Turning off the echo may be 
of use in collecting passwords. Also, it is of use in preventing 
a double echoing effect that would exist in the first sample 
program if the echo was not txirned off. 
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Sample Pseudo-Teletype Programs 

* This program implements a direct interaction with the 

* pseudo-teletype . 

* If a control «- is typed, a rubout is sent to the 

* pseudo- teletype . 

* If a control t is typed, the pseudo- teletype is logged out. 

BEGIN LOGIN( ) 

ECHO(-l) 
LOOPA A = INPUTC 

IDEI}T(A, '&t') LOGOUT( ) :S(EM)) 

IDENT(A, •&<-) SEND('&137') :S(L00EB) 

SEND (A) 
LOOPB OUTPUTC = RECV(#1AXLNGTH) :S(L00PB) F(LOOPA) 

* This program does a fixed assembly, load and a dump on a 
^ specified file (second mmmUm). 

* Output from the pseudo-teletype also goes to a specified 

* file (first ^iOHNWS). 

BEGIN DEFINE ( 'XMIT(X) ' ) 

N = OPENOUT(l^i^) 

ASSIGN('OUT',N) 

LQGIN( ) 

XMITC'KDF.RSI. '/SI' .RS2. '/S2' .F.") 

XMIT("NAEP./S1.'/B1'.") 

XMIT("NARP./S2.'/B2'.") ^ ^ 

XMIT("DDT.;T/B1.; T/B2.%FDUMP ON FILE" lilljjllifl ".") 

FINI CLOSE(N) LOGOUT( ) :(END) 

XMIT SEND(X) 

XMITl OUT = RECVLINE( ) :F(RETURN) 

OUT '?' :F(XMIT1) 

OUTPUT = 'TERROR' :(FINl) 
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APEEKDIX A 

Primitive Functions 
ANY(S) 

notanyCs) 

SPM(S) 

BREAK(S) 

POS(I) 

RPOS(I) 

TAB(I) 

RTAB(l) 

LEW(I) 

ARBNO(P) 

SIZE(S) 

le(i,j) 

LT(I,J) 
GE(I,J) 
GT(I,J) 
EQ(I,J) 
KE(I,J) 
CHAR(S) 
LIME(S) 

oeenin(s) 
openout(s) 

CLOSE (I) 

ASSIGN(S,I) 

RELEASE (S) 

LENGTH(I/J) 

DEFIlffi(&,S) 

INTEGER(X) 

TRIM(S) 

DATE() 

TIME() 

CLOCK(I) 

IDEWT(S,S) 

DIFFER(S,S) 
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(Appendix A Continued) 

Variables With Preset Values 

ARB 

BAL 

REM 

FAIL 

FENCE 

ABORT 

Special Input -Output Variables 

INPUT 
OUTPUT 
INPUTC 
OUTPUTC 

Reserved Labels 

RETURN 

FRETURN 

END 
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Operator Precedence Table 
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) 



binary 



(lowest) 
(space) 



unary 



.,$ 
+,- 



(highest) 



SNOBOlA Syntax 

statement=[ label] [' '[semi] [' '][':• • ':' goto]][' '] 

semi=anam[[' '] '=' [' •] pexp] I anam • ' [patt [' ']['=• [' •] pexp]]: 

aatm[ ' ' ] [patt ] 
patt=non-null pexp 
goto=[' •]'?• •(• gexp ')• [' ']['S' •(' gexp •)'] i 

[' ']'S' •(• gexp •)' [• ']['F' •(' gexp ')'] • 

[• •] •(• gexp •)• 
gexp=[* ']('$' aatm 1 label) [' '] 
pexp=ptrm| ' ' ". ' ' ' ptrm| 

ptrm=pprm-|' ' pprmj I null 

pprm=patm|' * ('.' '. '$') • ' anam| 

patm='*' anam ! bexp I 
bexp=atrml' ' ('+' l •• 

atrm=aprm|' ' ('*• I •/•) 

aprm=aun[ ' ' '*' '*• • ' aprm] 
aun='+' aatm I '-' aatm 1 aatm 
aatm='$' aatm t lit I name I fncl '. 

•(' [• •] aexp [• •] •)• 
fncl=fnam '(• [' •][pexp[' ']■[','[' •] pejcp [' ' ]]-] ')• 

aexp=bexp|' ' bexp| 

anam='$' aatm ! name 



( ' [ ' ' ] pexp [ • ' ] • ) • 
) ' • atrmj 



' ' aprmj- 
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(Appendix A Continued) 

Keywords 

©ABORT 

©ALPHABET 

©ANCHOR 

©ARB 

©BAL 

©DIGITS 

©FAIL 

©FENCE 

©FULLSCAN 

©INTLIMIT 

©LEVEL 

©MAXLNGTH 

©REM 

©STCOUNT 

©STFCOUNT 

©STLIMIT 



Pseudo-Teletype Functions 

LOGIN(NAME, PASSWORD) 

LOGOUT ( ) 

WAIT( ) 

SEND(S) 

ATSEra)(S) 

RECV(N) 

RECVLIHE( ) 

ECHO(il) 
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APPEHDIX B 
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EDITING COMTROL CHAEACTERS 



Control Chajacter 

A 

B 

C 

D 

E 

F 

G 

H 

I 

J, LINEFEED 

K 

M, CARRIAGE RETURN 

N 



P 

Q 

R 

S 

T 

U 

V 

W 

X 

Y 

Z 



Result 

Delete last character typed 

No action 

Copy character from old linte 

Terminate or copy rest of line 

Change insert-replace mode 

Copy rest of old line, no tjyping 

No action 

Copy to end of old line 

Tab 

Continuation 

Bftggjpaijjillt Give Cade t®r i^e'^i-f cWfi -kp^c^. 

Delete line 

Terminate statement 

Character delete, restoratiye 

Copy up to next character tVped 

Skip up to next character typed 

Delete statement, restoratiVe 

Retype, fast 

Skip character 

Retype, aligned 

Copy up to next tab stop 



Take. /ie^+ cMAi (fiem-tl* 



Delete word 

Skip through next character 

Concatenate and re-edit 

Copy through next character 

Continuation 



) 
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